A Study of Intensional Concept Drift in Trending DBpedia Concepts

نویسندگان

  • Albert Meroño-Peñuela
  • Efstratios Kontopoulos
  • Sándor Darányi
  • Yiannis Kompatsiaris
چکیده

Concept drift refers to the phenomenon that concepts change their intensional composition, and therefore meaning, over time. It is a manifestation of content dynamics, and an important problem with regard to access and scalability in the Web of Data. Such drifts go back to contextual influences due to social embedding as suggested by e.g. topic analysis, news detection, and trends in social networks. Using DBpedia as a source of timestamped Linked Open Data, we analyze the interaction between a sample of popular keywords, as recorded by Google Trends, and their respective concept drifts in DBpedia. For the latter task, we deploy SemaDrift, an ontology evolution platform for detecting and measuring content dislocation dependent on context modification. Our hypothesis is that social embedding and awareness is an important trigger for concept drift in crowdsourced knowledge bases on the Web.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Expansion of Tail Concept Using Web Tables

Human-curated knowledgebases like Freebase and DBPedia cover popular concepts such as persons, organizations and locations, but many more specific concepts fall into the long tail outside current knowledgebases, such as acidic fruits, HD video formats and renewable resources. These concepts are found in conceptentity pairs automatically extracted from text documents, but they cover a limited nu...

متن کامل

What Is Concept Drift and How to Measure It?

This paper studies concept drift over time. We first define the meaning of a concept in terms of intension, extension and label. We then introduce concept drift over time and two derived notions: (in)stability over a time period and concept shift between two time points. We apply our framework in three case-studies, one from communication science, on DBPedia, and one in the legal domain. We des...

متن کامل

Concept drift and how to identify it

This paper studies concept drift over time. We first define the meaning of a concept in terms of intension, extension and label. Then we study concept drift over time using two theories: one based on concept identity and one based on concept morphing. A qualitative toolkit for analysing concept drift is proposed to detect concept shift and stability when concept identity is available, and conce...

متن کامل

Evolutionary Conceptual Clustering Based on Induced Pseudo-Metrics

We present a method based on clustering techniques to detect possible/probable novel concepts or concept drift in a Description Logics knowledge base. The method exploits a semi-distance measure defined for individuals, that is based on a finite number of dimensions corresponding to a committee of discriminating features (concept descriptions). A maximally discriminating group of features is ob...

متن کامل

Detecting Concept Drift in Data Stream Using Semi-Supervised Classification

Data stream is a sequence of data generated from various information sources at a high speed and high volume. Classifying data streams faces the three challenges of unlimited length, online processing, and concept drift. In related research, to meet the challenge of unlimited stream length, commonly the stream is divided into fixed size windows or gradual forgetting is used. Concept drift refer...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017